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Abstract : Pseudo-random correlation systems find their 

application in many engineering fields like, communica- 
tions, nondestructive testing, medical imaging, and 
geophysics. The heart of such systems is a correlator, 
which performs the crosscorrelation between the received 
signal and a reference waveform. A new method of DSP- 
based correlator implementation is discussed. It exploits 
the structural characteristics of pseudo-random 
waveforms and performs the crosscorrelation of any 
digitized waveform with a reference pseudo-random 
waveform, in a manner, much more efficient in terms of 
processing speed and hardware requirements. This new 
method can be applied for baseband or bandpass 
waveforms, and it can handle a wide range of modulation 
schemes and signaling structures. In order to achieve 
greater resolution of the calculated correlation function, 
it is possible to compute the correlation function for the 
lag values in fraction of the basic chip interval. 

I. INTRODUCTION 

Pseudo-random correlation systems find their application 
in many engineering fields like, communications, 
nondestructive testing, medical imaging, and geophysics [l, 
2, 3]. They provide a convenient means to improve the 
dynamic range and the signal-to-noise ratio of the 
measurement, without reducing the resolution and without 
having to increase the peak power of the transmit waveform. 
There is, however, a price to pay in terms of greater system 
complexity and complicated signal processing. Until 
recently, the signal processing equipment has been 
prohibitively expensive for the widespread application of 
pseudo-random correlation systems, However, due to the 
advances in the ASIC technology, and the digital signal 
processing techniques, practical system implementations are 
becoming realizable. 

A new method of DSP-based correlator implementation is 
discussed- It exploits the structural characteristics of pseudo- 
random waveforms and performs the crosscorrelation of any 
digitized waveform with a reference pseudo-random 
waveform, in a manner, much more efficient in terms of 
processing speed and hardware requirement. This new 
method can be applied for baseband or bandpass waveforms, 
and it can handle a wide range of modulation schemes and 
signaling structures, In order to achieve greater resolution of 
the calculated correlation function, it is possible to compute 
the correlation function for the lag values in fraction of the 
basic chip interval. 
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II. PSEUDO-RANDOM CORRELATION SYSTEMS 

The model of a typical pseudo-random correlation system 
is given in Figure 1. The heart of the system is a correlator 
that calculates the correlation between the pseudo-random 
excitation waveform and the received waveform. The 
correlator implemented digitally is generally much superior 
in performance. The correlator can either be designed to 
perform a periodic (circular) correlation or a linear 
(aperiodic) correlation. Whenever possible, a periodic 
correlation is preferred as it performs better than its linear 
counterpart. 



Figure 1 : Pseudo-random correlation system model. 


Ill EXISTING CORRELATORS 

The earlier correlators were of analog type and their 
performance was relatively poor [1]. The present state-of- 
the-art coirelators are DSP-based, with much improved 
performance. However, the process is still computation 
intensive and requires costly computational resources. Two 
main techniques exist for the implementation of a digital 
correlator. These are, (a) Time-domain delay-multiply-add 
correlator (Figure 2) and (b) Frequency-domain FFT based 
correlator (Figure 3). 



Figure 2: Basic time-domain correlator block diagram. 


- 1 - 








Figure 3: FFT-based correlator block diagram. 


While the time-domain approach is simple and straight 
forward, it performs poorer in terms of computational 
efficiency. The frequency-domain approach is relatively 
efficient, though still slow and expensive. New devices and 
techniques are needed to reduce these costs and permit the 
wider use of correlators and the correlation systems. 

IV. NEW CORRELATOR 

A new method of DSP-based correlator implementation is 
discussed. It exploits the structural characteristics of a 
pseudo-random waveform based on the maximal-length 
sequence. The method can be applied for baseband or 
bandpass waveforms, and it can handle a wide range of 
modulation schemes and signaling structures. In order to 
achieve greater resolution of the calculated correlation 
function, it is possible to compute the correlation function for 
the lag values in fraction of the basic chip interval. The new 
method eliminates various kinds of redundancies in the basic 
correlation process. The correlation operation is broken into 
pieces and transformed into a form where the benefits of fast 
Hadamard transform are utilized (Figure 4). The resulting 
data is regrouped and transformed back to the standard form. 
This method performs most of the mathematical operations 
in the fixed point arithmetic format, thereby saving lot of 
storage space and processing time. 



Figure 4: FHT-based correlator block diagram. 


V. MATHEMATICAL DESCRIPTION 

The correlator calculates the cross-correlation function 
between the received waveform and a reference waveform. 
Let r(n) and s(n) represent the discrete-time versions of the 
received waveform and the reference waveform, respectively, 
each of period M. The waveforms in the vector notation can 
be represented as, 


r=[r 0 r x r 2 ... r M _{[ ( 1 ) 

s — [sq Sj s 2 ... ^_]] (2) 

The pseudo-random reference waveform, s(n), is derived 
from a two-valued maximal-length sequence, c(n), and the 
signalling waveform, p(n). This can be represented as, 

c=[c 0 Cj c 2 ... c £ _,] (3) 

P = [Po P\ Pi ■ • • Pn-\\ ( 4 ) 

l = [c 0 £ °iP c 2£ ■ ■ ■ c l-i£\ (5) 


where, L is sequence length, N is chip length, M is waveform 
length, and U = NL. The non-notmalized periodic (circular) 
cross-correlation of r(n) and s(n), is defined as, 

#*(*)= E r(n)s(n - k) (6) 

<t = 0 

where k represents the correlation lag value in samples such 

that, k ~ 0, 1, 2 (M-l). The cross-correlation operation 

represented by equation (6) is highly computation intensive. 
Each correlation value requires M multiplications and (M-l) 
additions. In order to reduce the computational load, 
equation (6) will be transformed into a form most appropriate 
for DSP-based implementation of the cross-correlation 
function. 

If only the correlation lag values in multiples of the chip 
interval are computed, equation (6) gets modified to, 

u - 1 

(Z £r(n)s(n-Nm) (7) 

n » 0 

where, m= 0, 1, 2, ... , (L-J). By exploiting the structure 
in the reference vector S, as shown in equation (5), the 
summation in equation (7) can be broken down into two 
parts, such that, 

L - 1 ^-1 

<p{m)= Z ^p(n)c(x-m)r(xN+n) (8) 

x = o <1 = 0 


Equation (8) in terms of matrix manipulation can be written 
as, 



4> = C • R p (10 ) 
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where, C is a right eirculaiit matrix whose first row is the 
vector c, and each successive row is obtained from the 
previous one by a single circular shift operation. The 
matrix Jt consists of the. elements of the received vector r, 
arranged in L rows and N columns. Since the elements of 
C are two valued, i.e.. - {A.BJ, where A and B are real 

ntunbers, it is possible to transform C into X through the 
transformation C = faX t bY}, such that x, = /+/, 
Hence, equation ( 1 0) C3n be written as, 

0 s* a ( X Bp) + b ! ) •' ■ R p ) ( 1 1 ) 


and in matrix form as, 
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where, a=jA-Bj/2 and b~{A+B}/2 and )' is an LxL matrix 
with all elements equal to one. The constants a and b can 
merge into vector £. such that g= ag and m~ £>£. therefore, 


(d = X • R g + Y ■ R m_ 


(13) 


The second term in equation (13) results into a column vector 
whose elements have identical values. Hence, its 
contribution to the correlation vector ^tis a constant dc shift 
In most practical applications, the absolute value of the 
correlation function is of little interest and only the 
normalized correlation function is important This implies 
that the second term of equation (13) can be dropped, 
resulting in, 

Pu \ Xu Xi X 2 • • AY. r 
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h jxxi JtS Xl ■ ■ Xl.. 
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-V • Rg_ (15 ) 

The cross-eoirelation operation reduced to equation (14) 
is the first step in the reduction of computational load. Since 
the elements of A' are all +7-1, only additions and subtractions 
are required to perform the matrix multiplication. Finding 
each element of the correlation vector ^ now requires (M~l) 


additions and only N multiplications instead of Si. 
Considering that, in general, M is much larger than N. this 
is a significant reduction of the computational load. Ft is, 
however, possible to reduce the computational requirement 
even further by exploiting the fast Had amard transform [2]. 

Fast Hadamard Transform A Hadamard matrix is defined 
recursi vely as, 

H = 1 

i 

H H 
H - 1 

2i H - H 

i i 


(16) 
(17 ) 


Only orders 2*, where k is a positive integer exists. A matrix 
of order 8x8 w ould be, 
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( 18 ) 


The Hadamard transform is defined as, 

y. ~ ff x 09 ) 

The input vector x is transformed into vector v through the 
Hadamard matrix multiplication. Based on the structure of 
H. it ts possible to, digitally implement this transform very 
efficiently. The flow graph for an 8-point fast Hadamard 
transform :(FHT) is shown in Figure 5. 


,x, , xr , — : x, 


basic butterfly element 



Figure 5: Flow graph for 8-point fast Hadamard transform. 


- 3 - 


This flow graph is identical to that of fast Fourier transform 
except that there is no multiplication involved in the present 
case [3]. 

Application of FHT to Correlator The FHT algorithm can 
not be directly applied to the cross-coirelation operation 
described in equation (14) because X is not a Hadamard 
matrix. However, it is possible to transform X into the 
required form by the substitution, 

X = P ■ S ■ H ■ S ■ P (20) 

7 7 11 

where, P, and P, are permutation matrices whose purpose is 
to permute the rows and the columns of H. In general the 
size of X is LxL, where L= (2*-l), while H can exist in sizes 
(L+l)x(L+l) only. Therefore matrices S, and S, are required 
to transform the size of H to that of X, by suppressing the 
fust column and the row of matrix H. Equations (15) and 
(20) result into, 

fi = (P (5 (H (S (P •*))))) (21) 

— 72 II 

where the parentheses indicate the sequence of operation 
leading to the following interpretation. The measurement 
matrix R is permuted according to P, and a row of zero 
elements is affixed to the beginning of the matrix. The 
resulting matrix is transformed by the fast Hadamard 
algorithm. Then the first row of the resulting matrix is 
dropped and the rest of it is permuted according to P } . 
Finally the correlation vector is obtained by the multi- 
plication of the previous results with the gain vector g. The 
correlation operation defined by equation (21) is now in the 
most appropriate form for an efficient digital implementation 
either in software or hardware. The calculation of each 
element of the correlation vector now requires about N.log(L) 
additions and N multiplications. 

Fractional-Lag Correlation The set of correlation values 
obtained through equation (21) correspond to the correlation 
lags in integer multiples of a chip duration only. In certain 
situations it is desirable to calculate the correlation function 
for lag intervals smaller than the chip interval. In order to 
calculate the fractional -lag correlation values, a slight 
modification to the received vector r is required prior to its 
decimation into matrix R according to equations (9) and (10). 

In particular, if the received vector r is given a left circular 
shift by one position, such that, 

£ ' =t 'l r 2 r 3 V r 0 ] (22) 

and, 



The resulting set of correlation values will correspond to 
the fractional-lag of {k+l/N, where k is an integer j of the 
basic chip interval. Similarly, by performing multiple 
circular shift operations on vector r, correlation values for 
lags in all integral multiples of 1/N can be calculated. 

VI. HARDWARE IMPLEMENTATION 

The proposed correlator can be implemented in hardware, 
either on a general purpose DSP board, or on a dedicated 
plug-in board based on various ASIC chips. In the later case, 
the digitizer can also be included on a single specialized 
board (Figure 6). 



Figure 6: Single Card Correlator 


The digitizer determines the size of the input RAM, and 
also the quantization noise level. The maximum sampling 
rate and the quantizer size is dictated by the specific 
application. However, 12 bit quantizer with 50 MHz 
sampling rate is a typical figure. The input RAM stores the 
received waveform r. The decomposition of r into matrix R 
and the permutation of vectors R t do not require any separate 
processing. These operations can be performed during the 
read or write cycle for the input RAM, without de mandin g 
any extra time. 

In a similar manner, on the output side, the permutation of 
output vector is also performed during the data storage cycle. 
The size of the output RAM depends on the number of 
stages in the fast Hadamard transformer which in turn 
corresponds to the order of maximal-length sequence. 

FHT Transformer The basic element of a FHT 
transformer is a two-input / two-output fixed point adder / 
subtractor (Figure 7). 

(stage-1) (stage-2) (stage-3) 

8-bit add/sub 9-bit add/sub 10-bit add/sub 



Figure 7: FHT Transformer 
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A group of adders / subtractors is arranged into various 
stages. There are (L+l)/2 adders/subtractors in each stage, 
where L is the length of the corresponding maximal-length 
sequence. The total number of stages is also a function of the 
sequence length. The size of the two inputs to the 
adder/subtractor depends on, 1) the quantizer size and 2) the 
number of stage to which the adder/subtractor belongs. In 
general the input data size progressively increases with 
increasing stage number. 

VII. SOFTWARE IMPLEMENTATION 

The software implementation of correlator consists of, (a) 
main program, (b) FHT subroutine, and (c) permutation 
subroutine. The permutation subroutine (Figure 8) generates 
the permutation matrices P, and P 2 . For a given maximal- 
length sequence, these matrices can be generated once and 
stored. Hence, repeated correlations corresponding to the 
same maximal-length sequence do not require that the 
permutation subroutine be called each time. 



Figure 8: Permutation subroutine. 


The FHT subroutine (Figure 9) is the core processing 
function of the correlation process. The main program 
(Figure 10) calls this subroutine several times for calculating 
a single set of correlation values. The correlator program was 
implemented in Matlab and its operation tested and verified 
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Adder/Subtractor 
x = (a+b) 
y = (a-b) 

no x / r^iriV yes 

▼ 

Output = (x y] 


Figure 9: FHT subroutine. 



Figure 10: Main correlator program 


Vin. PERFORMANCE COMPARISON 

A rough comparison of the performance of proposed 
correlator implementation with other standard 
implementations can be readily made. If L is the length of 
the ml-sequence that constitutes the transmit waveform and 
there are N samples per symbol. One period of the waveform 
will consist of NL samples. Hence, the basic time-domain 
correlator will require NL real multiplications and NL real 
additions, in order to calculate one correlation value. 


_L 


g - [x y] 
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The FFT-based approach requires, (a) one FFT of length 
NL, (b) one IFFT of length NL, and (c) NL complex 
multiplications. These numbers, however, correspond to all 
the NL correlation values. Assuming that, 

(a) one FFT requires NL.logJNL) complex 

multiplications and about the same number of 
complex additions, 

(b) one IFFT requires NL.log,(NL) complex 

multiplications and about the same number of 
complex additions, 

(c) one complex multiplication requires four real 
multiplications and two real additions, 

(d) one complex addition requires two real additions. 

This gives us the average of 8xlog : (NL ) real 
multiplications and 8xlog 2 (NL) real additions for each 
correlation value. The proposed approach, on the average 
requires only N real multiplications and N log/L) real 
additions for each correlation value. In general, L is much 
larger compared to N and therefore, the new approach is 
mostly dominated by additions. The following table 
compares the computational requirements of the proposed 
approach with the existing approaches, taking 1=1,000 and 
AMO. 
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Correlator Type 

it of real 
multiplications 

# of real 
additions 

Basic time-domain 

10,000 

10,000 

FFT-based 

no 

108 

Proposed approach 

10 

100 


Table 1 : Processing requirement per correlation value. 


DC. CONCLUSION 

A new computationally efficient method of DSP-based 
correlator implementation is discussed. This method is 
applicable for pseudo-random correlation systems employing 
maximal-length sequence in a continuous mode. The new 
method exploits the structural characteristics of pseudo- 
random waveforms and performs the crosscorrelation of any 
digitized waveform with a reference pseudo-random 
waveform, in a manner, much more efficient in terms of 
processing speed and hardware requirement. This new 
method can be applied for baseband or bandpass waveforms, 
and it can handle a wide range of modulation schemes and 
signalling structures. In order to achieve greater resolution of 
the calculated correlation function, it is possible to compute 
the correlation function for the lag values in fraction of the 
basic chip interval. 




